RAW SEQUENCE LISTING DATE: 08/20/2004 

PATENT APPLICATION: US/10/817 , 418 TIME: 10:44:03 



ENTERED 



Input Set : A:\38002071.app 

Output Set: N: \CRF4\08202004\J817418 • raw 

3 <110> APPLICANT: YANG, JIANXIN 

5 <120> TITLE OF INVENTION: GENE AMPLIFICATION AND OVEREXPRESSION IN CANCER 
7 <130> FILE REFERENCE: 38002-0071 
9 <140> CURRENT APPLICATION NUMBER: 10/817,418 
10 <141> CURRENT FILING DATE: 2004-04-05 

12 <150> PRIOR APPLICATION NUMBER: 60/462,702 

13 <151> PRIOR FILING DATE: 2003-04-15 
15 <160> NUMBER OF SEQ ID NOS : 23 
17 <170> SOFTWARE: Patentin Ver . 3.2 

19 <210> SEQ ID NO: 1 

20 <211> LENGTH: 3001 

21 <212> TYPE: DNA ^ 

22 <213> ORGANISM: Homo sapiens * ' 

24 <400> SEQUENCE: 1 

25 tgagtaaatc gatacatcat acgcgcgctc ctctggccgc ccctccctcc gacgatcggg 60. 

2 6 gaccctggcg ggcggcagga ggacatggcc agcgacgccg tgcagagtga gcctcgcagc 120 
27 tggtccctgc tagagcagct gggcctggcc ggggcagacc tggcggcccc cggggtacag 180 

2 8 cagcagctgg agctggagcg ggagcggctg cggcgggaaa tccgcaagga gctgaagctg 24 0 

29 aaggagggtg ctgagaacct gcggcgggcc accactgacc tgggccgcag cctgggcccc 3 00 

30 gtagagctgc tgctgcgggg ctcctcgcgc cgcctcgacc tgctgcacca gcagctgcag 360 

31 gagctgcacg cccacgtggt gcttcccgac ccggcggcca cccacgatgg cccccagtcc 420 

32 cctggtgcgg gtggccccac ctgctcggcc accaacctga gccgcgtggc gggcctggag 480 

33 aagcagttgg ccattgagct gaaggtgaag cagggggcgg agaacatgat ccagacctac 540 

34 agcaatggca gcaccaagga ccggaagctg ctgctgacag cccagcagat gttgcaggac 600 

35 agtaagacca agattgacat catccgcatg caactccgcc gggcgctgca ggccgaccag 660 

36 ctggagaacc aggcagcccc ggatgacacc caagggagtc ctgacctggg ggctgtggag 72 0 

37 ctgcgcatcg aagagctgcg gcaccacttc cgagtggagc acgcggtggc cgagggtgcc 780 

38 aagaacgtac tgcgcctgct cagcgctgcc aaggccccgg accgcaaggc agtcagcgag 840 

3 9 gcccaggaga aattgacaga atccaaccag aagctggggc tgctgcggga ggctctggag 900 

40 cggagacttg gggagctgcc cgccgaccac cccaaggggc ggctgctgcg agaagagctc 96 0 

41 gctgcggcct cctccgctgc cttcagcacc cgcctggccg ggccctttcc cgccacgcac 1020 

42 tacagcaccc tgtgcaagcc cgcgccgctc acagggaccc tggaggtacg agtggtgggc 1080 

43 tgcagagacc tcccagagac catcccgtgg aaccctaccc cctcaatggg gggacctggg 1140 

44 accccagaca gccgcccccc cttcctgagc cgcccagccc ggggccttta cagccgaagc 12 00 

45 ggaagcctca gtggccggag cagcctcaaa gcagaagccg agaacaccag tgaagtcagc 1260 

46 actgtgctta agctggataa cacagtggtg gggcagacgt cttggaagcc atgtggcccc 132 0 

47 aatgcctggg accagagctt cactctggag ctggaaaggg cacgggaact ggagttggct 1380 

48 gtgttctggc gggaccagcg gggcctgtgt gccctcaaat tcctgaagtt ggaggatttc 1440 

49 ttggacaatg agaggcatga ggtgcagctg gacatggaac cccagggctg cctggtggct 1500 

50 gaggtcacct tccgcaaccc tgtcattgag aggattcctc ggctccgacg gcagaagaaa 1560 

51 attttctcca agcagcaagg gaaggcgttc cagcgtgcta ggcagatgaa catcgatgtc 1620 

52 gccacgtggg tgcggctgct ccggaggctc atccccaatg ccacgggcac aggcaccttt 1680 

53 agccctgggg cttctccagg atccgaggcc cggaccacgg gtgacatatc ggtggagaag 1740 
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54 ctgaacctcg gcactgactc ggacagctca cctcagaaga gctcgcggga tcctccttcc 1800 

55 agcccatcga gcctgagctc ccccatccag gaatccactg ctcccgagct gccttcggag 1860 

56 acccaggaga ccccaggccc cgccctgtgc agccctctga ggaagtcacc tctgaccctc 1920 

57 gaagatttca agttcctggc ggtgctgggc cggggtcatt ttgggaaggt gctcctctcc 198 0 

58 gaattccggc ccagtgggga gctgttcgcc atcaaggctc tgaagaaagg ggacattgtg 2 04 0 

59 gcccgagacg aggtggagag cctgatgtgt gagaagcgga tattggcggc agtgaccagt 2100 

60 gcgggacacc ccttcctggt gaacctcttc ggctgtttcc agacaccgga gcacgtgtgc 2160 

61 ttcgtgatgg agtactcggc cggtggggac ctgatgctgc acatccacag cgacgtgttc 2220 

62 tctgagcccc gtgccatctt ttattccgcc tgcgtggtgc tgggcctaca gtttcttcac 22 80 

63 gaacacaaga tcgtctacag ggacctgaag ttggacaatt tgctcctgga caccgagggc 2340 

64 tacgtcaaga tcgcagactt tggcctctgc aaggagggga tgggctatgg ggaccggacc 2400 

65 agcacattct gtgggacccc ggagttcctg gcccctgagg tgctgacgga cacgtcgtac 246 0 

66 acgcgagctg tggactggtg gggactgggt gtgctgctct acgagatgct ggttggcgag 2 52 0 

67 tccccattcc caggggatga tgaggaggag gtcttcgaca gcatcgtcaa cgacgaggtt 2580 

68 cgctaccccc gcttcctgtc ggccgaagcc atcggcatca tgagaaggct gcttcggagg 2640 

69 aacccagagc ggaggctggg atctagcgag agagatgcag aagatgtgaa gaaacagccc 2700 

70 ttcttcagga ctctgggctg ggaagccctg ttggcccggc gcctgccacc gccctttgtg 2760 

71 cccacgctgt ccggccgcac cgacgtcagc aacttcgacg aggagttcac cggggaggcc 2 82 0 

72 cccacactga gcccgccccg cgacgcgcgg cccctcacag ccgcggagca ggcagccttc 2880 

73 ctggacttcg acttcgtggc cgggggctgc tagccccct^c ccctgcccct gcccctgccc 2940 

74 ctgcccgaga gctcttagtt tttaaaaagg cctttgggat ttgccggaaa aaaaaaaaaa 3 00 0 

75 a 3001 

78 <210> SEQ ID NO: 2 

79 <211> LENGTH: 942 

80 <212> TYPE: PRT 

81 <213> ORGANISM: Homo sapiens 
83 <4 00> SEQUENCE: 2 

Met Ala Ser Asp Ala Val Gin Ser Glu Pro Arg Ser Trp Ser Leu Leu 
1 5 



84 
85 



10 



15 



87 Glu Gin Leu Gly Leu Ala Gly Ala Asp Leu Ala Ala Pro Gly Val Gin 



88 



20 



25 



30 



90 Gin Gin Leu Glu Leu Glu Arg Glu Arg Leu Arg Arg Glu lie Arg Lys 

91 35 '40 45 

93 Glu Leu- Lys Leu Lys Glu Gly Ala Glu Asn Leu Arg Arg Ala Thr Thr 

94 50 55 60 

96 Asp Leu Gly Arg Ser Leu Gly Pro Val Glu Leu Leu Leu Arg Gly Ser 

97 65 70 75 80 

99 Ser Arg Arg Leu Asp Leu Leu His Gin Gin Leu Gin Glu Leu His Ala 

100 85 90 95 

102 His Val Val Leu Pro Asp Pro Ala Ala Thr His Asp Gly Pro Gin Ser 

103 100 105 110 

105 Pro Gly Ala Gly Gly Pro Thr Cys Ser Ala Thr Asn Leu Ser Arg Val 

106 115 120 125 

108 Ala Gly Leu Glu Lys Gin Leu Ala lie Glu Leu Lys Val Lys Gin Gly 

109 130 135 140 

111 Ala Glu Asn Met lie Gin Thr Tyr Ser Asn Gly Ser Thr Lys Asp Arg 

112 145 150 155 160 

114 Lys Leu Leu Leu Thr Ala Gin Gin Met Leu Gin Asp Ser Lys Thr Lys 

115 165 170 175 
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264 <212> TYPE: DNA 

265 <213> ORGANISM: Homo sapiens 

267 <400> SEQUENCE: 3 

268 atggccagcg acgccgtgca gagtgagcct cgcagctggt ccctgctaga gcagctgggc 60 

269 ctggccgggg cagacctggc ggcccccggg gtacagcagc agctggagct ggagcgggag 120 

270 cggctgcggc gggaaatccg caaggagctg aagctgaagg agggtgctga gaacctgcgg 180 
2 71 cgggccacca ctgacctggg ccgcagcctg ggccccgtag agctgctgct gcggggctcc 240 
2 72 tcgcgccgcc tcgacctgct gcaccagcag ctgcaggagc tgcacgccca cgtggtgctt 3 00 
2 73 cccgacccgg cggccaccca cgatggcccc cagtcccctg gtgcgggtgg ccccacctgc 360 
274 tcggccacca acctgagccg cgtggcgggc ctggagaagc agttggccat tgagctgaag 42 0 
2 75 gtgaagcagg gggcggagaa catgatccag acctacagca atggcagcac caaggaccgg 48 0 
2 76 aagctgctgc tgacagccca gcagatgttg caggacagta agaccaagat tgacatcatc 54 0 
2 77 cgcatgcaac tccgccgggc gctgcaggcc gaccagctgg agaaccaggc agccccggat 600 
2 78 gacacccaag ggagtcctga cctgggggct gtggagctgc gcatcgaaga gctgcggcac 660 
2 79 cacttccgag tggagcacgc ggtggccgag ggtgccaaga acgtactgcg cctgctcagc 720 

280 gctgccaagg ccccggaccg caaggcagtc agcgaggccc aggagaaatt gacagaatcc 780 

281 aaccagaagc tggggctgct gcgggaggct ctggagcgga gacttgggga gctgcccgcc 840 

282 gaccacccca aggggcggct gctgcgagaa gagctcgctg cggcctcctc cgctgccttc 900 

283 agcacccgcc tggccgggcc ctttcccgcc acgcactaca gcaccctgtg caagcccgcg 960 

284 ccgctcacag ggaccctgga ggtacgagtg gtgggctgca gagacctccc agagaccatc 102 0 

285 ccgtggaacc ctaccccctc aatgggggga cctgggaccc cagacagccg Gccccccttc 1080 
2 86 ctgagccgcc cagcccgggg cctttacagc cgaagcggaa gcctcagtgg ccggagcagc 1140 
2 87 ctcaaagcag aagccgagaa caccagtgaa gtcagcactg tgcttaagct ggataacaca 1200 
2 88 gtggtggggc agacgtcttg gaagccatgt ggccccaatg cctgggacca gagcttcact 1260 

289 ctggagctgg aaagggcacg ggaactggag ttggctgtgt tctggcggga ccagcggggc 1320 

290 ctgtgtgccc tcaaattcct gaagttggag gatttcttgg acaatgagag gcatgaggtg 1380 

291 cagctggaca tggaacccca gggctgcctg gtggctgagg tcaccttccg caaccctgtc 1440 
2 92 attgagagga ttcctcggct ccgacggcag aagaaaattt tctccaagca gcaagggaag 15 00 

293 gcgttccagc gtgctaggca gatgaacatc gatgtcgcca cgtgggtgcg gctgctccgg 1560 

294 aggctcatcc ccaatgccac gggcacaggc acctttagcc ctggggcttc tccaggatcc 1620 
2 95 gaggcccgga ccacgggtga catatcggtg gagaagctga acctcggcac tgactcggac 1680 
296 agctcacctc agaagagctc gcgggatcct ccttccagcc catcgagcct gagctccccc 1740 

2 97 atccaggaat ccactgctcc cgagctgcct tcggagaccc aggagacccc aggccccgcc 1800 

298 ctgtgcagcc ctctgaggaa gtcacctctg accctcgaag atttcaagtt cctggcggtg 1860 

299 ctgggccggg gtcattttgg gaaggtgctc ctctccgaat tccggcccag tggggagctg 1920 

300 ttcgccatca aggctctgaa. gaaaggggac attgtggccc gagacgaggt ggagagcctg 1980 

301 atgtgtgaga agcggatatt ggcggcagtg accagtgcgg gacacccctt cctggtgaac 2040 

302 ctcttcggct gtttccagac accggagcac gtgtgcttcg tgatggagta ctcggccggt 2100 
ggggacctga tgctgcacat ccacagcgac gtgttctctg agccccgtgc catcttttat 2160 

304 tccgcctgcg tggtgctggg cctacagttt cttcacgaac acaagatcgt ctacagggac 2220 

305 ctgaagttgg acaatttgct cctggacacc gagggctacg tcaagatcgc agactttggc 2280 

306 ctctgcaagg aggggatggg ctatggggac cggaccagca cattctgtgg gaccccggag 2340 

3 07 ttcctggccc ctgaggtgct gacggacacg tcgtacacgc gagctgtgga ctggtgggga 2400 
308 ctgggtgtgc tgctctacga gatgctggtt ggcgagtccc cattcccagg ggatgatgag 2460 
3 09 gaggaggtct tcgacagcat cgtcaacgac gaggttcgct acccccgctt cctgtcggcc 252 0 

310 gaagccatcg gcatcatgag aaggctgctt cggaggaacc cagagcggag gctgggatct 2580 

311 agcgagagag atgcagaaga tgtgaagaaa cagcccttct tcaggactct gggctgggaa 2640 

312 gccctgttgg cccggcgcct gccaccgccc tttgtgccca cgctgtccgg ccgcaccgac 2700 

313 gtcagcaact tcgacgagga gttcaccggg gaggccccca cactgagccc gccccgcgac 2760 
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